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We consider a preferential growth model where particles are added one by one to the system 
consisting of clusters of particles. A new particle can either form a new cluster (with probability q) 
or join an already existing cluster with a probability proportional to the size thereof. We calculate 
' exactly the probability Vi(k,t) that the size of the i-th cluster at time t is k. We analyze the 

asymptotics, the scaling properties of the size distribution and of the mean size as well as the 
relation of our system to recent network models. 
PACS numbers: 05.10.-a, 05.40.-a, 02.50.Cw 



I. INTRODUCTION 



Nonuniform growth is inherently present in a broad class of phenomena including the development of biological 
populations, communication networks or economic systems like incomes of persons or companies Jl|-0. In many cases 
it is obvious to assume that in a system consisting of groups or clusters of units the attachment of a new entity to 
one of the groups depends on the already achieved strength or size of that particular group. Simon Q analysed a 
simple model of this kind where the growth probability was proportional to the cluster size and he gave exact results 
for the time dependent size distribution. Referring to the examples of words in a book or personal incomes Simon 
derived a power law distribution of cluster sizes. Recently, in the search for an explanation of the widely observed 
scale invariance of large networks like the WWW , the Internet or power networks [[| , scientific citation || the 
idea of preferential growth has been applied to evolving graphs . It turned out that such graphs behave remarkably: 
They have "small world" properties || and the distribution of the strength of vertices (number of edges from or to a 
vertex) is scale free, provided that the probability of linking a vertex with a new one is proportional to its strength Q 
This class of models represent a new mechanism for "self-organized criticality" . The idea of preferential growth 
seems to be essential in economic systems too where clustering of companies, e.g., according to their market seem to 
follow such a pattern jl]J . 

, These models have been treated by different tools including simulations, continuum or mean field theories [ p"2| and 
t-H ' exact calculations [Q,[l3| by which information has been accumulated about the asymptotic behavior and the time 
dependence of the global distribution functions. However, much less attention has been paid to the full time-dependent 
solution of the problem. The aim of the present work is to give such a solution of a particular model. 
The paper is organized as follows. In Section II we define the model and the quantities of interest as well as we 
present the basic master equation. In Section III the main steps of the full time dependent analytic solution is given 
and the consequences for the steady state and the integrated distributions are drawn. Section contains the analysis 
about the asymptotics and scaling. In Section V we present a discussion of our results. The paper terminates with 
two appendices containing some details of the calculations. 

o " 



II. MODEL 



We model a growing system which consists of groups of different sizes. At the beginning (t = 1) we have one group 
with one element in it. At each time step we add a new element to the system. With probability p it will belong to 
one of the existing groups. The probability that it joins the i-th group is proportional to the size of the group (ki/N), 
see Fig |l} (The number of elements is equal to the time, N = t, because the system size is rising by one in each time 
step.) With probability q = 1 — p the new element will belong to a new group. 
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FIG. 1. Demonstration of the model. The black point on the top denotes the new incoming element, the boxes on the bottom 
are the groups. 



The process can be described by the following master equation: 

Vi(k, t)=p ^~ / Vi(k - 1, t - 1) +p ( 1 - 



Vi(k,t-1) 



t-1 ' " v " ' ' V *- 1, 

+ (1 - P ) Vi{k, t - 1) + (1 - p) ILi^t - 1) tf fc) i(l - ^i), 



(1) 



where Vi(k,t) is the probability that at time t there are k elements in the group i, and is the probability that 

at time t there are i groups in the system: 



Hi(*)= p 4 - 1 -^ 1 ' " 1 



(i-pY 



(2) 



In the following we introduce some important quantities and their definitions. 

Given the size distribution of the individual groups, Vi(k,t), the size distribution of the total system can be calculated 
as their average: 



»=i 



(3) 



In the long time limit this quantity approximates to a stationary value: P(k) = limt— >oo P(^j t). 
The mean of the i-th group size: 



t-i+l 



(h)(t)= E kVi{k,t). 



(4) 
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The reason that the upper limit of the above sum is not infinity is that Vi{k, t) = if k > t — i + 1. 



III. ANALYTIC CALCULATIONS 
A. Asymptotic distribution of group size 

In the first step we calculate the group size distribution in the asymptotic case, P(fc). 

The exact analytic formula for P(fc) was already calculated in [HJlq], we present it here to see the dependence of the 

exponent on the parameter p. 

If we sum up Eq. (|l|) for i = 1 . . . t, we get: 

t P(fc, t) = (t - 1 - pk) P(fc, t - 1) + p(k - 1) P(k - 1, t - 1) + (1 - p)5 k ,i , (5) 

since: 



2 
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Vi{k, t - 1) = y, tmm - 1) = (* - i)p(*> * - !)• 



t=l i=l 



The stationary behavior of P(fc,<), mentioned in the previous section, can be checked from Eq. (||). Replacing the 
stationary quantity P(fc) into Eq. ml) one gets: 

P(fc) = -pfcP(fc) +p(fc - l)P(fc - 1) + (1 - p)S k<1 , (6) 

which can be solved for P(fc): 
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FIG. 2. Group size distribution in the asymptotic limit, for different p values. 

B. Analytic solution for the individual group size distribution 

In the model the first group has an accentuated role since it always has at least one element because of the initial 
conditions. Therefore the master equation (|l|) for the first group (i = 1) has the following simpler form: 

P 1 (k,t) = V 1 (k,t-i) - kPi(M-i) + (fc-i)7M*-M-i). (8) 

For k = 1 in the above equation on the r.h.s the last term vanishes so the probability Vi(l, t) can be calculated easily: 

V,(lt)= r(t ~ p) (9) 
Y(t)T{l-pY () 

For k > 1 one can prove (see Appendix [A|) that the following equality holds: 

The analytic form of V\{k, t) can be received from Eq. (|lo| ) by multiplying both sides with (— I)' -1 (tZi) an d summing 
up for 2 = 1 . . . k, 



In the case of i > 1 we have to look at the hole Master equation ([j]). In this case the equality ( ^p| ) doesn't hold 
because of the last factor in (jl]). Our assumption is that the probability Vi(k,t) will have a modified form: 



Vi(k,t) 



1=1 



r(f - i P ) 
1-1J r(t)r(i - ip) 
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(12) 



The validity of the above form can be checked by replacing it back in Eq. (Tfl), see Appendix 



C. Mean value of group sizes 



Replacing the analytic formula ([12]) into (Q) one gets: 



k=l 1=1 
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The two sums can be transposed ( J2k=i S/=i = J2i=i 12 



k=l J' 



and 
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so the mean value will have the following form 

i-i+l 



p 6_i (l-p) 



i-1 



(13) 



(14) 



D. Time dependent solution for the group size distribution P(fc, t) 



In Sec. Ill A we calculated the stationary group size distribution directly from the master equation. Now we are 
interested in its dynamic. In order to compute that, we start from the definition (^) of P(fc,t), and replace the 
solution we got for Vi(k, t) in the previous sections (|l2|). 
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r(6)r(i - Zp) /b - 2 
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Transposing the two sums: ^Z*_ 2 X)fj=i = Sb=2 Si=2' an< ^ taking into account that: 



yv r(b)r(i - Zp) _ r(i-zp) r(* + i) 



6=2 



r(b - Zp) 



(i + Zp) r(t-zp) i + zp' 



one finally arrives at the time dependent distribution: 
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(1-P) 



i-2 



(15) 



P(M) = ^(-l)'- 1 



i=i 



I - 1 



1 — p p + lp T(t — Ip) 



i + ip i + ip r(t + i)r(i - ?p) 



P(fc,oo) 



(16) 



In the long time limit we will get back our result (ffl) since the second term in [ • • ■ ] decays for large t values with 
f- 1 "^, and the sum transforms into: 



P(fc,oo) 



i-p r(fc)r(2 + i/p) 
i + p r(/c + i + i/p)' 
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IV. ASYMPTOTIC CASES 



We study the t — » oo limes of Vi(k,t) and (ki)(t). In the analytic formula for Vi(k,t), see. Eq. (|1J), there are two 
components, A(l,p,t) and B(i,l,p,t), that depend on the time. 



v i (k,t) = (i- P y- 1 J2(- 1 ) 1 



_j/fc-i\ r(t-ip) 



/ - iy r(*)r(i - Zp) 



r(6)r(i - ip) - 2 



r(6 - Zp) V* - 2 



„b-i 



A(l,p,t) 

The limes of the first term, A(l,p,t), can be easily calculated 



B(i,l,p,t) 



(17) 



= nr^Tp)' 



-ip 



The second term in the long time limit t^$>i,l will converge to a hypergeometric sum: 

lim B(i,l,p,t) = B(i,l,p) = r( ' )r(1 ~ IP) 2 F 1 (i,i-l;i-lp;p). 

t^oc 1 (l — ip) 

For large time values the only time dependent term in (|l7|) will be t~ lp which in case of large t is a fast decaying 
function of I. So in the case of t 3> k we can assume that only the first term of the sum gives non-negligible component 
for Vi(k,t), 



lim V i (k,t)=r p (l-py- 1 -^- 2 F 1 (i,i-l;i-p;p) + 0(t- 2 P) 

t— >oo I (l — p) 



(18) 



For large i values the above formula simplifies further, because in that case lim^oo 2 F\(i, i — l;i — p;p) ~ (1 — p) \ 



and lim,_ 



•oo r(i-p) 



lim Pi(fc,i) 

t,i—*oo 



(19) 




To study the asymptotic behavior of (fcj)(i) we start from the fact, that for small k values, « (, the individual 
group size distribution, Vi(k,t), can be described by the first term of the sum, see Eq. (|18|), and for larger values 
k > t it has a fast decay, Fig. |j. A cut-off parameter, fc*, can be defined and we can assume that (jij) transforms into 

(ki)(t) re ^> n(fc,t) = n(l,<) fc * (fc * 2 + 1} . (20) 
fc=i 
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FIG. 4. Distribution of individual group size in the long time limit (t — 10 9 ) as a function of the group size 
The definition of k* can be done in many ways. We defined k* as the inflection point of Vi(k. t), hence: 



B(»,4,p) r(l-3p) 
Replacing fc* into (p0|) 



r(i-3p) 2 -Pi(i,i-l;*-4p;p) 



(h)(t) ~ f(l - p)- 1 -^- 2^1 (*, i - 1; * - p;p) 



r(i — 4p) 2^1 («, i - 1;« - 3p;p) 



r(i-3p) 2Ji(i,i-l;i-4p;p) 



(21) 



(22) 



For large i values the above formula gets a simpler form, because in this case lim.t_,oo r(i-p) = * P ' um i^oo r(l-3p) = * P 1 
lim^oo 2 F 1 (i,i- l;i - 3p;p) = lim^^ 2 F 1 (i,i- l;i - 4p;p), lim^oo 2 Fi(i,i - l;i -p;p) ~ (1 -p) 1_s . 



(*i>(*) 



(23) 



V. DISCUSSION 



In this paper we presented a simple preferential growth model consisting of a system of clusters with different sizes. 
We gave exact solutions for the main characteristic quantities as the distribution, Vi(k, t), and the mean value, (fei)(t), 
of the individual group size as well as for the distribution of the average group size, P(fc, t). 

The question rises why are such time dependent quantities of interest since most of the asymptotic scaling behavior 
can be obtained with much less labor. In fact the growth models and network usually provide only a background for 
some dynamic process - an aspect which has not yet paid enough attention to. If there is a strong separation of time 
scales, i.e., the growth is much smaller than the process itself then it is satisfactory to concentrate on the asymptotics 
only. This is probably the case with the Internet or the WWW. However, in some cases such a separation of scales 
could be approximate only or even missing and then the importance of the full time dependence becomes apparent. 
We expect that in certain economic processes this will be the case. 

An important aspect in the asymptotic scaling is universality. Similarly to other preferential growth models, our 
system exhibits nonuniversal parameter dependent scaling: the exponents depend on the parameter q (the probability 
of creating a new group). It is worth mentioning that the examples quoted in the introduction also show a wide 
variety of scaling exponents. Further interesting study would be to analyse a model where this parameter q depends 
on the time of the growth. 

The presented system is not a network, the different groups are not linked to each other. However, for a specific value 
of the parameter, p = 0.5, it can be interpreted as a kind of mean field network model. The clusters then denote the 
different nodes, and the particles are the links. The value p = 0.5 means that in average in every second time step 
one new group and two elements are created (in the odd time steps the new element joins to an old group and in even 
time steps it will create a new group.) The new group is the new node while the two new elements are the two ends 
of the new link, one is pointing to the old node, the other is to the new one. This case corresponds to the Barabasi's 
network model with parameter m = 1 which means that the new node connects to one old sites. For this particular 



G 



parameter choice our results agree with them got for the Barabasi's network model: P(fc) ~ k 3 , see Eq. (^), and 
(h){t) ~ y/t/i, see Eq. @. 
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APPENDIX A: 



We prove the assumption (pL0|) . 

If one multiplies (|J) by (—l) k ^ 1 ( l k 7_\) and sums it up for k = 1 . . . I one gets: 
l / , -, \ i 



fe=i 



k - 



J ?MM) = £(-i) 



-1/^-1 



fc=l 



t-l 



k - 1 



E(- 1 )*" 1 u_ 1 i)*^(*.*- 1 )-E(- 1 )*" l u !)<fr-i>*M/---i-'- i) 

k=l ^ ' k=l 



k-l 



(Al) 



y 



where in the first term (x) we detach the last term of the sum: 



fc=i 



(A2) 



Taking into account that ( j, 1 ) = ^r( k _\) the second term (y) can be rewritten as: 

V = E(-l)^ 1 ( l k l\) (k l)Vi(k - l,t - 1) = - ^(-lf- 1 (I - k) iVW, t - 1) 

k=2 ^ ' fc=l ^ ' 



(A3) 
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y = l ^(-if-'il^-Pi^t-l). (A4) 



fc=i 



Replacing the difference, x — y, back to Eq. ( |Al[ ) one gets the time evolution of the sum: 



I 



fc=l x ' k=l 



which leads us back to our assumption (|f0|). 



APPENDIX B: 



We prove the formula ( p"2|) for V%{k, t) in the case of i > 1, by replacing it into Eq. ([!]). 
The l.h.s of the equation after detaching the last term (b — t) of the sum: 

The first term of the r.h.s: 



t - 1 - k p P j k t-i) = (-i^- M y r ^ ( b - % 

t-i M ' j 1 j r(t) r(6-fep)^i-2i p 



~^^h r(t-i) ^1X^)^-2^ ' (B2) 



Taking into account that (fc — 1) = (fc — I) the second term will be: 



^. (i -M-.,-^|(^-o(j:;)E^| i ^j(j:^. ^ 

The sum of (Q and @ will be: 

^W-l) + ^^-M-l) = 

Tft-fcp) ^ T(b) fb-2\ f k -l\ T(t-lp) ^ T(b) (b-2) 

y > r{t) T(b-kp)\i-2j P > T(t) £j T(b-lp) (i-2) P 

~h { ] UiJTirFWM) ? ' (B4) 



which will be equal to the second term of (Bl). Simplifying with this term the remaining equation: 



^>-'!>i)'-'(U) =G:^- 4 - <■»> 



Which is true, because the sum equals with 5k j.. 
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